Picture for Fanyu Meng

Fanyu Meng

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Add code
Jun 01, 2026
Viaarxiv icon

Structure-Guided Visual Perturbation Neutralization for LVLMs

Add code
May 27, 2026
Viaarxiv icon

JT-SAFE-V2: Safety-by-Design Foundation Model with World-Context Data

Add code
May 23, 2026
Viaarxiv icon

Strategy-Aware Optimization Modeling with Reasoning LLMs

Add code
May 04, 2026
Viaarxiv icon

DR$^{3}$-Eval: Towards Realistic and Reproducible Deep Research Evaluation

Add code
Apr 16, 2026
Viaarxiv icon

CCR-Bench: A Comprehensive Benchmark for Evaluating LLMs on Complex Constraints, Control Flows, and Real-World Cases

Add code
Mar 09, 2026
Viaarxiv icon

Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models

Add code
Mar 03, 2026
Viaarxiv icon

From Helpfulness to Toxic Proactivity: Diagnosing Behavioral Misalignment in LLM Agents

Add code
Feb 04, 2026
Viaarxiv icon

Thinking-Based Non-Thinking: Solving the Reward Hacking Problem in Training Hybrid Reasoning Models via Reinforcement Learning

Add code
Jan 08, 2026
Viaarxiv icon

RECALLED: An Unbounded Resource Consumption Attack on Large Vision-Language Models

Add code
Jul 24, 2025
Viaarxiv icon